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Abstract 

Suppose that we are given a quantum computer programmed ready to perform a 
computation if it is switched on. Counterfactual computation is a process by which the 
resuh of the computation may be learnt without actuaUy running the computer. Such 
processes are possible within quantum physics and to achieve this effect, a computer 
embodying the possibility of running the computation must be available, even though 
the computation is, in fact, not run. We study the possibilities and limitations of 
protocols for the counterfactual computation of decision problems (where the result r 
is either or 1). If pr denotes the probability of learning the result r "for free" in a 
protocol then one might hope to design a protocol which simultaneously has large po 
and pi . However we prove that po + Pi ^ 1 in a-ny protocol of this type and derive 
further constraints on po a-nd pi in terms of N , the number of times that the computer 
is not run. In particular we show that any protocol with p^ + pi — \ ~ e must have N 
tending to infinity as e tends to 0. We show that "interaction- free" measurements can 
be regarded as counterfactual computations, and our results then imply that N must 
be large if the probability of interaction is to be close to zero. Finally, we consider some 
ways in which our formulation of counterfactual computation can be generalised. 
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1 Introduction 

There is a set of remarkable phenomena that seem to be special to quantum mechanics. 
Their common theme is counterfactuality: the fact that an event might have happened 
enables one to obtain some information about that event, even though it did not actu- 
ally take place. Examples of such counterfactual phenomena include the Elitzur-Vaidman 
bomb testing problem (Elitzur &: Vaidman 1993, Penrose 1994) and the use of so-called 
interaction-free measurements (Kwiat et al. 1995, Renninger 1960, Dicke 1981, Geszti 
1998) to determine the presence or absence of an object by means of a test particle, even 
though no "interaction" may have occurred between the object and the particle. With 
some protocols, the probability of an interaction occurring can be made arbitrarily small. 
By carrying out a raster-scan of such measurements, it is possible to form an image of an 
object in an interaction- free manner (White et al. 1998). 

One can extend these ideas and imagine the object being replaced by a quantum com- 
puter (Jozsa 1999). In that case, it turns out that one can determine the outcome of a 



computation without the computer ever being switched on. This is counterfactual compu- 
tation. In the quantum formahsm the computer may be in a superposition of being on and 
off, and wc will need to clarify the sense in which the computer is "not switched on" . This 
is the content of our precise definition of counterfactual computation given below. There 
are many possible protocols for counterfactual computation which give various probabilities 
of gaining information 'for free'. The aim of this paper is to show that there are limits 
on these probabilities which hold for all possible protocols; there are also limits for pro- 
tocols depending on how often the computer is given the opportunity to run (in a sense 
to be made precise). As we shall see, counterfactual computation includes interaction- free 
measurement as a special case, and our limits then translate into statements about the 
probability of interaction. 

It is well known that quantum physics has a profound bearing on issues of computation 
as described for example in (Bernstein &: Vazirani 1993, Deutsch 1985, Deutsch &; Jozsa 
1992, Ekert & Jozsa 1996, Grover 1996, Simon 1994, Shor 1994). Much of that work is 
directed towards questions of computational complexity, in particular devising quantum 
algorithms that exhibit an exponential speedup over any known classical algorithm for the 
computational task. Counterfactual computation is a novel quantum computational effect 
of an entirely different sort. It does not involve a speedup but highlights in a particularly 
poignant way, some of the interpretational enigmas of quantum mechanics and its theory of 
measurement. It remains to be seen whether it can be exploited for real practical benefit. 



2 Defining counterfactual computation 

We begin by formalising the idea of counterfactual (henceforth abbreviated to CF) compu- 
tation. Consider a quantum computer with an 'on-off ' switch programmed ready to solve a 
decision problem when the switch is turned to 'on'. This computer has an output register 
that represents the binary result of the computation. The switch and the output register 
will be denoted by a pair of qubits 5(8)0, with S the switch qubit and O the output register. 
The switch states 'off' and 'on' are labelled respectively as |0) and |1). 

The computer will generally require extra storage qubits for its programming and extra 
working qubits for its computational processing. If TZ denotes the state space of these extra 
qubits then the total state space of the computer is »S (g) O (g) 7?.. As is customary in idealised 
quantum computation, we will assume that the computer operates in a fully reversible 
manner and that the initial and final states of TZ are equal. Also the binary result r of the 
computation is deposited in the output register qubit as an addition modulo 2. Thus the 
computational process may be written 

|0) |j) \R) ^ |0) \j) \R) (computer 'off') 

|1) \j) ^ |1) \j ®r)\R) (computer 'on') ^ ^ 

and it is completed in some finite known time T. 

If the computer is switched on, after a time T it will have carried out one of two unitary 
operations Uq or Ui on S fS> O corresponding to the two possible outputs r = or r = 1. 
Here Uq is the identity on the switch and register qubits and Ui is the C-NOT operation on 
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these two qubits (cf eq. @)). Thus Uo takes |10) into |10) and Ui takes |10) into |11) but 
if the computer is not switched on then the evolution is the same for both possible values 
of r (in fact, being then Uq). 

We will assume that the computer is given as a device with the switch and output qubits 
S O initially set to |0) |0). If the switch is set to |1) the computer will effect either the 
transformation Uq or Ui and we wish to determine which it is, without switching it on. We 
will assume that we are unable to access the qubits in TZ. 

We begin with a row of qubits (as long as required) all initialised in a standard state 
|0). A protocol consists of a sequence of steps where each step is one of the following: 

(a) A unitary operation (not involving the computer) on a finite number of specified qubits. 

(b) A measurement on a finite number of specified qubits. 

(c) An "insertion of the computer" , where the state of two selected qubits is swapped into 
the registers 5 O of the computer, a time T is allowed to elapse and finally the states are 
swapped back out into the two selected qubits. 

The steps of the protocol are implemented by selecting the designated qubits from the row, 
applying the operation and then returning the qubits to their positions in the row. The 
whole protocol may be thought of as a network of operations with the computer being 
inserted at various places via (c). The same Ur is used throughout the protocol. The out- 
comes of the measurements in (b) are the source of our information about the computation 
and its result. 

To understand how a protocol works and to provide a definition of counterfactual com- 
putation, we introduce a notation which records whether the computer was on or off on 
the various occasions when it was inserted. At the beginning of any such step the entire 
state space can be partitioned into two orthogonal subspaces, the 'off' and 'on' subspaces, 
corresponding to the switch states |0) and |1), respectively, and we will separate the total 
state into its two superposition components in these subspaces. 

Imagine that, after each insertion of the computer, we carried out a measurement that 
projects into these subspaces, with outcomes which we write compactly as / (for off) or n (for 
on). Then each possible list of //n outcomes, together with outcomes of the measurements 
in the protocol, defines what we shall call a history. One can depict this imaginary protocol 
as a branching structure. At each node we give the un-normalised state vector, the result 
of the projections occurring at each measurement step of the protocol or at the imaginary 
measurement following an insertion of the computer. If Vn denotes the un-normalised state 
vector at the final node of the history h, the probability of h is The initial node is 

assumed to correspond to the specified initial state of the row of qubits. Note that there 
will be a different branching structure depending on whether Uq or Ui is being used. 

In the real protocol, we do not necessarily carry out the f/n measurement (though of 
course a protocol could include such a measurement). Thus in contrast to actual measure- 
ment steps of type (b), the unnormalised state vectors at the f/n branchings still retain the 
information of relative phases between them i.e. the branching gives two coherent superpo- 
sition components of the state vector. Then we can utilise the branching structure of the 
imaginary protocol to compute probabilities for the real protocol as follows. Let m denote 
a sequence of measurement outcomes, one for each measurement step of type (b) in the 
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protocol. There will be various histories h that include those outcomes (we write m C h). 
For instance, in Example 1 below, with Uq the histories /O/OO and nO/00 both include 
the measurement outcomes m = 0, 00. (Here the first measurement in the protocol is of 
one qubit, giving the '0', whereas the second measurement is of two qubits, giving '00'; see 
Figure 1). The probability of m is just | J2mch''^h\'^ , the sum being taken over all histories 
including m. The contributions from these histories parameterised hy f /n sequences, are 
added coherently as the f /n branchings do not correspond to actual measurements. (If an 
f /n measurement is actually performed, its designated outcome will be listed in m instead). 

Note that our use of the term 'history' differs from the more conventional use (e.g. in 
the so-called consistent histories approach to quantum mechanics) . In the latter, every node 
is viewed as a measurement and different paths (histories) are always added incoherently 
(i.e. as a sum of squares rather than a square of a sum) in computing the probability of a 
sequence m of specified outcomes of a subset of all the measurements performed. 

We are now ready to define a CF computation. Let m denote a sequence of measurement 
outcomes. We say that m is a CF outcome of type r , where r = or 1, if a history h including 
m satisfies Vh ^ with Ur only if h is ail-off, and if the probability of m with Ui-r is zero. 
This means that if we observe the measurement sequence m then we can infer that the 
computation result is certainly r even though the computer has not run at any stage of the 
protocol (since m can only occur via an ail-off history). More formally we have: 

Definition: m is a CF outcome of type r, where r = or 1, if 

1) With Ur, Vh = for all h with m C h, except for that h that has only /'s (in addition to 
m). 

2) With Ui-rj Y.h.rnch"^'h = 0, the sum being over all h that include the measurement 
outcomes m; i.e. with Ui-r, ^ is seen with probability zero. 

Let us pause to elaborate on the sense in which "the computer has not been run at 
any stage" if the CF outcome sequence m is observed. The reader may object that the 
computer has certainly been in a superposition of 'off' and 'on' and in this sense, has 
actually been run! But the sequence m can arise only via all-off histories so if it is seen 
then we have been confined to a part of the total quantum state in which the computer is 
never run. For example, in the language of the many worlds interpretation, we will have 
evolved in a world in which the computer was never run yet in that world we learn the 
result of the computation. The fact that the computer may have run in another world is 
of no consequence for us. The validity of this point is especially emphasised in the original 
Elitzur-Vaidman bomb testing problem (Elitzur & Vaidman 1993) (which we will later see 
as a special case of our CF formalism): there is always a world in which a good bomb will 
explode but (with a suitable measurement outcome) we are confined to another world in 
which the bomb is left unexploded and yet we have the knowledge that it is, in fact, a good 
bomb, available for future explosive applications. 

This issue is at the heart of the so-called measurement problem of quantum theory. 
Consider the proverbial Schrodinger cat in state -^(1 alive) + |dead)). What happens upon 
a measurement of alive versus dead? Suppose that the outcome is seen to be "alive". 
Then in the conventional "collapse of wavefunction" formalism, the state of the cat changes 
discontinuously into | alive) and the |dead) component ceases to have any further physical 
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reality or existence or any further consequence. In a many- worlds or decoherence view 
(Zurek 1991) of the measurement process both components |alive) and |dead) persist and 
may be thought of as existing in separate "parallel" worlds. The measurer similarly passes 
into all these parallel worlds, registering respectively all possible measurement outcomes. 
Since | alive) and |dead) are macroscopically distinct states they rapidly interact with many 
other subsystems in the external universe, spreading orthogonality. This process is the 
decoherence of the initial superposition which we symbolically write as: 

|dead)|0)|0)...|0) ^ |dead)|0)|0)...|0) 

lalive) |0) |0) . . . |0) ^ |alive) |1) |1) . . . |1) ^ ' 

Here |0) and |1) are orthogonal states and the (very long) string of kets after the leftmost ket 
represent a large number of subsystems in the universe. Hence the two worlds or branches 
will never again interfere in practice - re-interference would require an enormous correlated 
effect to undo the widespread interaction in eq. (^) and is extremely improbable. Hence 
although the cat exists dead in the other world, we in this world (having perceived the 
outcome "alive") will never have to bother about or suffer any consequences of that other 
unfortunate outcome. In either interpretation we would be happy that the cat continues to 
exist fully alive, despite that fact that it was in a strange superposition in the past. 

In a similar way in our definition of CF outcome, the desirable option (like cat alive) is 
the computer being off and the undesirable option (like cat dead) is the computer being on. 
Although we pass through a superposition of these states, the observation of a CF outcome 
means either that we have collapsed the state to an entirely "off" part of the total state 
and the "on" part ceases to have any further physical reality or consequences, or that we 
are in a world in which the computer has not run, yet in both cases we also have the result 
of the computation. 

We note also that neither of the interpretations of the measurement problem described 
above is really satisfactory: in the first, no mechanism is given for the discontinuous "col- 
lapse" of the state and in the second, no explanation is given of why we actually perceive 
just one of the parallel worlds upon making a measurement - it is difficult to present tan- 
gible physical evidence for the existence of the other worlds. 

Example 1 

The following example is given in (Jozsa 1999) but we re-express it here in terms of our 
formalism. The only variables that concern us are the switch and output register qubits. 
Starting with |00), the state is rotated to cos0|OO) + sin^|lO), where 6* = 2^ for some integer 
A^. A time T is then allowed for the computer to run, if it is going to. This gives the state 
cos^jOO) + sin0|lr), assuming Ur is used (r = 0, 1). The second qubit is now measured. If 
r = 1, the measurement either gives |00) with probability cos^0, or |11) with probability 
sin^O. If the latter occurs, we know that r = 1 and the computer has run; the protocol 
therefore halts. If r = the measurement yields cos0|OO) + sin0|lO), and we repeat the 
preceding steps, rotating by 9, allowing the machine to run, then measuring. The branching 
structures for Uq and Ui are shown in Figure |l[ 

After A^ repeats, if r = the state will have rotated to |10) with certainty. If r = 1, 
the state will be |00) with probability cos(7r/2A^)^^. In this case the computer has not run. 
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loo) — ►c|00>+ s|10> 
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"0^ 110)^ -s|00)+ c|10) 
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second bit 



measure 
both bits 
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111) 



"0';^ 100)-^ cioo)+siio)' 

/ 



100) 
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/ 



100) 
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Figure 1: The protocol for Example 1, for the case N = 2^ shown as a branching structure, 
as described in the text. The thin boxes denote insertions of the computer and the dotted 
vertical lines measurements. Here c = cos 6 and s = sm.6. Arrows are labelled by / or n, 
and by measurement outcomes (these are all shown in inverted commas). The sequence 
of labels defines the history associated to a path. For instance, the uppermost path with 
C/i, marked with heavy arrows, is the history /O/OO. At each node the un-normalised state 
vector is shown. Under Ui, this is the only history including the measurement outcomes 
m = and 00, and it is an all-/ history, so condition 1) for a CF outcome of type 1 is 
satisfied. Under Uq there are two histories including m, namely fOfOO and nO/00, and 
the sum of their final un-normalised state vectors is (c^ — s'^) 1 00). If we chose 6 = 7r/4 so 
(? — = 0, the probability of m under Uq is zero, so condition 2) for a CF outcome is 
satisfied. 
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yet we know that r = 1. This is therefore an example of a CF computation. (In Figure ||, 
the history that gives the CF result is shown by bold arrows.) By making large enough, 
cos(7r/2A^)^^ can be brought arbitrarily close to 1. Thus if r = 1 we obtain the CF result 
with a probability approaching 1. 

Example 2 

We describe next a protocol where both types of CF outcome occur (see Figure ^). In 
addition to the switch and output qubits, we require a third, ancillary, qubit. To start with, 
the initial state |000) is rotated to c|000) + s|100), where c = cos 9, s = sin 9. Next we in- 
sert the computer on the first two qubits and finally measure all qubits in the following basis: 

\xi) = t(s|000) - c|100) + s|001)), 
\X2) = t(s|000) - cjllO) - sjoOl)), 
|X3) = n(c|000) + 2s|100) + c|001)), 
\xa) = n(c|000) + 2s|110) - cjoOl)), 
|X5> = |010), |X6) = |101), 
\xj) = |011), Ixg) = 

where t = 1/Vl + s2, u = l/v'2 + 2^2. There are two CF outcomes, |rEi) with r = 1 
and \x2) with r = 0. They satisfy our two conditions, namely: 1) the history leading to |a;i) 
under Ui has only 'f's, as does that leading to \x2) under Uq, and 2) has probability 
zero under Uq as does \x2) under Ui. The probabilities of these CF outcomes are both 
{cst)^. This is maximised by = 2 — which gives (cst)^ = 0.172, or Pq + pi = 0.344. 



3 Limits on count erf actual computation 

Let us write p^, pi for the probabilities of getting a CF outcome of type or type 1, 
respectively in any given protocol. Example 1 shows that there is a protocol for which pi 
approaches 1, but in this case po = 0. Could one devise a protocol that allowed bothpo and 
pi to approach 1? Less ambitiously, can one devise a protocol giving po + pi > 1? These 
questions were posed in Jozsa (1999), and we answer them here. 

First note that, since the subdivision into on-off subspaces corresponds to an orthogonal 
decomposition of the entire state space (and the same is true of each measurement step of 
type (b)), we have, for each Ur 

Y.\-k\' = l. (3) 

h 

In Figure 1, for instance, under Uq we have J2h b/iP = + c^s^ + + c^s^ = 1, for any 
9. This is true despite the fact that the final states of the histories may not be orthogonal 
for the original protocol e.g. |00) and — |00) for Uq in figure 1; we are using the fact 
that each step in the histories is either a measurement in the protocol, or can be regarded 
as one (as in our imaginary protocol). 

This basic fact may be alternatively expressed as follows. We imagine a supply of extra 
qubits, as many of them as there are insertions of the computer in our protocol. If we carry 
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^4 



Figure 2: The protocol for Example 2. Here the first two qubits are, as usual, the switch 
and output register, and the third qubit corresponds to an ancilla. The only vectors in the 
x-basis relative which have non-zero probability of being final outcomes are those shown in 
the figure. The bold arrows show the histories that give CF outcomes; there is one of type 
and one of type 1. 
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Table 1: One CF outcome, Mi. 





Uo 


Ui 


Ml 





off only 


M2 


X 


X 









out a C-NOT operation between the switch qubit and the n-th extra qubit at the n-th 
insertion, the resultant states at each branching of the imaginary protocol will be mutually 
orthogonal in the extended space. Hence the branching tree corresponds to a probabilistic 
process, and eq. (|3|) expresses the certainty of following some path through the tree. 

Now we know from condition 1) above that the ail-off history h alone determines the 
probability of the CF outcome. But, for an ail-off history /i, Vh must be the same 
whether Uq or Ui is used, since the computer is never switched on! Thus, in forming the 
sum po+pi, i.e. in collecting terms for all possible CF outcomes, one can assume that 
all of these terms come from the list of v^s for one of the Ur, say Uq. Note that condition 
1) implies that there is no possible overlap: one can only get a CF outcome with one of the 
Ur for a given m. 

Thus po + Pi = J2heJ^ where the sum is over the set J- of ail-off terms from CF 
outcomes. These terms can be assumed to be derived using Uq, so equation (^) implies: 
Theorem 3.1. po + Pi < 1- 

This answers the conjecture in Jozsa (1999). We now look at some subsidiary questions. 

In Example 1, one can only approach the bound of 1 closely by having a large number, 
A'", of insertions of the computer. Could there be protocols which reach the bound of 1, 
or come close, with only a few insertions? The following argument shows that this is not 
possible. 

Consider first the situation shown in Table |l|. There is only one CF outcome, and we 
ask how many insertions of the computer we need to get the probability of this result, pi, 
to satisfy pi = 1 — e. 

Let h{i) denote a history consistent with measurement outcome sequence Mj. Under Uq 
we have: 

Ei^Mi)i' = i- E bMoi'<i- 

h{l) h{i),i>2 

Let F(l) denote the all-/ history consistent with Mi. Then 

Pi = I'^FCl)!^ < 1 - E 

h(l)^F(l) 

SO, 

E Kii)f<e. (4) 
On the other hand the total un-normalised state vector (under Uq still) for Mi is zero, so 

E^Mi)=0- (5) 

HI) 
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Table 2: Both CF outcomes: r = 1 (Mi) and r = (M2). 







Ui 


Ml 





off only 


M2 


off only 





M3 


X 


X 









Thus 



and hence 



/x(l)^F(l) 



1 - e. 



(6) 



(7) 



Now, for any vectors Xi, the minimum of J2iLi l^^ip given | J2iLi Xil"^ = S is S/K. There 
are 2''^ histories consistent with Mi for N computer insertions, so 



E \vhii)\^>{l-e)2- 



Using equation (^), (1 — e)2 ^ < e, or 

N > log2 



1 - e 



(8) 



Consider next the situation shown in Table |2|. Here both types of counterfactual result 
occur, and we assume po + pi = 1 — e. Under Uq we have 

\vF(l)\^ + \vFi2)\^+ E H{1)\^<1. 

h{l)^F{l) 

Since po = 1 1^^(1)1^ and pi = |f_p{2)Pi we obtain as before. Furthermore, equations @ 
and (^) hold, but now the equivalent of (|^ is 



E 

hil)^F{l) 



Pi- 



(9) 



We therefore obtain, in place of (|8|), > log{pi/e). On the other hand, under ?7i we have 
N > ^og{po/e). Since max(po>Pi) > (1 — e)/2 we conclude that 



N > log2 



1 - e 
2e 



(10) 



In Tables |l] and ^ only one CF outcome of each type is shown. However, no real extra 
generality is gained by allowing several CF outcomes of the same type. To see this, note 
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that in each measurement step of type (b), instead of cohapsing the state, we may leave 
it entangled with an auxiliary system representing the measurement pointer. Thus each 
measurement step becomes a unitary operation of type (a). Finally at the end of the 
protocol we read all of the pointers together. This may be viewed as a single measurement 
whose possible outcomes are the possible measurement sequences of the original protocol. 
All CF outcomes of one type can then be grouped into a single final measurement outcome. 

We summarise these results as follows: 
Theorem 3.2. The number of insertions N must tend to infinity as po + pi tends to 1. 

It is possible that the inequalities and (|l^) can be sharpened. This is certainly true 
when A'^ = 1. Consider first the situation where only one type of CF outcome occurs (Table 
Hj). The only possible histories are fi or ni, where i denotes outcome Mj. With C/q, @ 
gives 

^ \Vfi\'^ + = 1. 

i 

Since f/i + fni = we have 

2\vfi\'' = l- J2 {\vM^ + \Vm\^)- (11) 
i>=2 

We can also add the un-normalised state vectors leading to Mi, M2, etc, and consider just 
the projections onto the subspaces of the final measurement. This gives: 

\Vfi+Vni\'' = 1, (12) 

i>=2 

where the sum goes from i = 2 because Vfi +Vni = 0. From equations (11) and (12) we get 

pi = \vfi\^ = 1/4 - 1/2 ^ \vf, - Vm\' < \. (13) 

i>=2 



For A = 1, (^) implies e > 1/3, or pi < 2/3, so ( [131) is an improvement; in fact, (|13|) is best 
possible because Example 1 with N = 1 gives pi = 1/4. (More precisely we would need an 
extra final rotation step, omitted from figure |^, at the end of the protocol before measuring 
both bits.) 

Consider next the case where both types of CF outcome occur (Table I). With Uq, d) 
gives 

+ bm|^ = 1- 

i 

and now f/i + Vni = and Vn2 = 0, so 

2b/iP + \vf2\^ = 1 - E + l^-l') ■ (14) 

j>=3 

Again, we can add the un-normalised state vectors leading to Mi, M2, etc, which gives: 

l^'/sl^ = 1- E \Vfi+Vni\'^, (15) 
i>=3 
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Equations (|l^ and ([T5[) imply 

i>=3 

With Ui we get 

|t;/i|^ + 4|-u/2p = 1 - E Ivfi-Vnil"^- 

i>=3 

the tilde in Vni distinguishing the state vector with Ui from that with Uq. Adding the last 
two equations gives: 

^b/iP + 5|^^/2p = 2 — (positive terms), 

so 

Po+Pi<2/5. (16) 

For = 1, (|lO|) implies e > 1/5, or po + Pi < 4/5, so ( |l6|) is an improvement. We do not 
know, however, whether it is best possible. In Example 2, po + pi = 0.344. Can we attain 
the bound of ([l^ ) with some other protocol? 

4 'Interaction-free' measurements 

Our formalisation of counterfactual computation provides a general framework that in- 
cludes 'interaction-free' measurements (IFM) (Elitzur & Vaidman 1993, White et al. 1998, 
Vaidman 1996) as a special case which is characterised as follows. Suppose we have an 
apparatus and an object. We interpret the switch qubit as defining configurations in which 
the apparatus cannot interact with the object (switch=|0)), or can (switch=| 1)), and we 
interpret the output register qubit as defining the state of the object, with |0) signifying 'no 
interaction' and |1) 'interaction has occurred'. The additional constraint that defines an 
IFM is that each insertion of the computer, now viewed as a potential interaction between 
apparatus and object, is followed by a measurement of the output register. If this is |1), an 
interaction has occurred and the protocol halts. In contrast, in a general CF protocol, the 
output register may be set to |1) also by other unitary steps (not involving the computer) 
and the |1) branches may be used in subsequent interferences. 

In an IFM protocol we interpret Uq as the computation corresponding to the object 
being absent, and Ui to that when it is present. Thus pi is the probability of an IFM 
occurring. As for pQ, it can be interpreted as the probability of the system being confined 
to the non-interactive configuration when the object is absent. This is a rather artificial 
concept, and theorem 3.1 does not translate into a very meaningful result. More interesting 
is the implication of theorem 3.2: 

Theorem 4.1. The number of times N an interaction with an object does not occur must 
tend to infinity as the probability of an IFM tends to 1. 

Our Example 1 can be interpreted as an IFM, since the output register is measured 
after every insertion. Example 2 cannot be so interpreted, however, since the measurement 
following the insertion leaves the output register in a superposition of the states |0) and |1). 



12 



5 More general CF computation 



Our definition of CF computation may be generalised in various natural ways such as the 
following. We may be given more than two unitary operations Ur on a product space S<SiO, 
whose parts may be larger than just qubits. Each operation Ur is required to have and "on" 
and "off" subspace defined by an orthogonal partition of S and Ur is the identity on its 
"off" subspace. Given an unknown one of the Ur's we wish to determine r while being 
confined to its "off" subspace. 

We now describe an explicit example of this type of generalisation which allows the 
switch space S to have an arbitrary number of dimensions, and which also allows more than 
two operations Ur, but O is still taken to be a qubit. We consider later in more detail the 
case where there are only two f/^'s but an arbitrary dimensional switch space, and prove an 
analogue of theorem 3.2. 

Our example is motivated by thinking of an interferometer in which the incoming photon 
is split into an equal superposition of K arms, numbered to X — 1, with absorbing objects 
blocking all the arms except for arm and arm r (r ^ 0). The problem is to identify the 
value of r without an interaction occurring. Our protocol below will also be an IFM in 
the sense of section 4 i.e. the protocol measures the output register after each potential 
interaction and halts if an interaction is seen to have occurred. However this IFM problem 
differs from standard IFM, where no prior assumption is made about the object being 
present or absent; here we consider a special set of configurations of objects. 

More formally we consider a K dimensional switch with state space S spanned by 
{|0) , . . . ,\K — 1)}. The output register O is still a qubit and we define K — 1 unitary op- 
erators Ur for r = 1, K — 1 on S O by: 

Ur{\ij)) = Hi = 0,r (the 'off '-subspace), 
Ur{\ij)) = — j), if ? 7^ 0, r (the 'on'-subspace). 

Thus Ur corresponds to arms and r being unblocked. The "off" subspace for Ur (on 
which Ur is the identity) is span(|0) , |r)) (g) O while the "on" subspace (on which Ur negates 
the output qubit value) is the orthogonal complement. Given an unknown one of the J/^'s 
wc wish to identify r while confining ourselves to its "off" subspace. More formally we 
modify our original definition of CF outcome as follows: 

Definition: m is a CF outcome of type r if 

1) With Ur, Vh = for all h with m C h, except for that h that has only /'s. 

2) With Us, s ^ r, m is seen with probability zero. 

Consider now the following protocol. Starting with |00), we apply to the switch a unitary 
transformation R given by 



K-l 



i?(|00)) = a|00) + b\iO) , 



1=1 



and 



K-l 



(K-l) 



i?(|jO)) = -6|00) + |jO)+ ^ 



iO),j /O. 



1=1 
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where + {K — 1)6^ = 1. We now insert Ur-, then measure the output register. If it is 
|1) the protocol terminates; otherwise the protocol continues by repeating the above. After 
7r/26 steps|^ the protocol halts. We may show (cf the geometric interpretation of the process 
below) that as h decreases, the probability of interaction occurring decreases. In a way that 
is similar to example 1, for small enough Kh the final state is, with high probability, |rO). 
According to our definition above, this will then be a CF outcome of type r. 

The workings of the protocol may be understood in terms of a geometrical interpretation 
of its operations. Consider first the case K = ?>. Looking at the switch variable, R is just a 
rotation in the plane of |0) and |1) + |2), i.e. about the axis |1) — |2) which is orthogonal to 
those two states. To see this just note that R is some rotation (as columns are orthonormal 
and det(i?) = 1) and R leaves 1 1) — 12) fixed. The angle of rotation is given by cos Q = {v\Rv) 
for any choice of v in the rotation plane; taking v = |0) gives cosO = a. The protocol can 
then be thought of as follows: R repeatedly applied gradually pushes the state |0) around 
to the equal superposition of all the other basis states, except that after each incremental 
rotation R, the state is projected into the 2D space of |0) and |r) (so long as the output 
register is always measured to be |0)). It is clear that |0) will be pushed around to |r) and 
we get Pr near 1 for all r's if b is sufficiently small, keeping the state always close to the 
rotation plane. This 2D motion is not a simple rotation because of the repeated projections 
(which do not commute with R). If the output register is ever measured to be |1), the state 
has been projected orthogonal to the rotation space and we abort the process. 

For general K the operation R on the switch variable is just the same rotation in the 
2D subspace of |0) and |1) + |2) + . . . |Er — 1), being the identity in the orthogonal (K — 2)- 
dimensional complement C. To see this note that R leaves invariant all — 2 vectors of 
the form |1) — |r), r = 2, . . . , K — 1. These vectors are all orthogonal to both |0) and 
|1) + |2) + . . . |if — 1) so they span C. The angle of rotation is as given above and again we 
get a 2D motion in the plane of |0) and |r). 

Now consider K = 3 again: As in our original definition of CF computation, there are 
two operations, here Ui and U2, but now the switch variable's dimension is three (i.e. the 
switch is a "qutrit"). Both pi and p2 have a natural interpretation, (the probability of an 
IFM if an object is in arm 2 or 1, respectively), so one might hope that theorem 3.1 would 
yield a meaningful result. However, the theorem (stating that pi + P2 < 1) evidently fails 
in this generalised setting since both pi and p2 tend to 1 for small enough b. The reason 
for this failure is that the proof assumes that the two Ur^s have the same 'off' subspace. 
This was true of the original C/^'s but is not true in the present case as the switch qutrit 
is |0) or |1) in the 'off' space of Ui, and is |0) or |2) in the 'off' space of U2- We therefore 
cannot hope to prove an equivalent of theorem 3.1. However, we show next that a version 
of theorem 3.2 does hold, namely that the number of steps must be large for po + pi to 
approach its upper bound. This bound is 2 in the present example rather than 1. 

This new theorem holds not only for IFMs, but for any CF computations, given certain 
restrictions on the 'on'- and 'off '-subspaces of the C/^'s. We label the two Ur's as Uq and 
Ui again, but now allow any finite dimensional switch space S. We assume that S is parti- 

^More precisely, the number of steps that yields the state closest to |rO) can be calculated for given K 
and b, and tends to it /2b as Kb tends to zero. 
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tioned into orthogonal 'on'- and 'off '-subspaces for each of Uq and Ui, and we also assume 
that these decompositions are compatible in the sense that the subspaces 

Sa= [Uo 'off' & Ui 'on'], 
Sb= [Uo 'on' k Ui 'off'], 
Sf = [Uo 'off' & Ui 'off'], 
5„= [Uo 'on' & [/i 'on'], 

span the whole of S. We also assume that Uo is the identity on Sa and Sf 0, and ?7i 
is the identity on Sb and Sj O. With these assumptions we have 
Theorem 5.1. T/ie number N of insertions must tend to infinity as po + pi tends to its 
maximum value. 

The proof is given in the Appendix. Note that the theorem applies to the 3-armed 
interferometer as iSa = {|1)}, iSf, = {|2)}, 5/ = {|0)} and <S„ = 0, and these subspaces span 
S. Here there are two operations Ur, but it seems likely that theorem 5.1 can be generalised 
to any number of operations satisfying appropriate compatibility conditions on their 'on' 
and 'off' subspaces. 

6 Discussion 

Our starting point was the enticing notion of being able to run a quantum computer 'for 
free'. The quotation marks here were very necessary, however, for it is not at all clear 
what if anything comes for free. A CF protocol requires the computer to be present, and 
due time must allowed for the machine not to run. If the computer does not run, then 
wc might expect to gain protection against decoherence. However, this appears not to be 
the case because our computer has to eschew all dissipative processes if running it is to 
constitute a unitary operation (as is necessary in a CF protocol to correctly generate the 
destructive interferences which allow us to conclude that an outcome is CF for a value of 
r), even though it might not actually be run. Perhaps the computer owner could determine 
whether the machine had run and levy a charge accordingly; we would then at least make 
a financial gain from a CF computation. Suppose, for instance, at each step where there 
was an 'on'-'off' choice the owner arranged for the switch state to be entangled with an 
extra qubit. At the end of the run he could measure the extra qubits to see what path the 
computation had taken. However, it is easy to see (in example 1 for instance) that this 
measurement generally destroys the interference that gives rise to a CF outcome. 

Thus the naive notion of gain from a general CF computation has to be abandoned. 
It does not follow, however, that CF computation is a meaningless abstraction. As we 
have seen, the framework includes 'interaction-free' measurement, where (despite the quo- 
tation marks) the practical gain is real, since it includes such potential benefits as X-ray 
images with reduced radiation damage (Vaidman 1996). By making the definition of CF 
computation sufficiently broad, we might hope to capture other types of interesting counter- 
factuality. In the last section we took some steps towards generalising our initial definition. 
We conclude by suggesting two further steps in this direction. These are both probabilisitic 
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extensions of CF computation. 

First, suppose the output register is extended to K + 1 values, and that running the 
computer takes the state |10) \x) to |lr) ® \x) under Ur, for r = 0, . . . , K , where \x) = 
|0 . . . 0) in a 2-^-dimensional ancillary Hilbert space. By analogy with Example 2, one can 
rotate |00)(gi|x) to (c|00)+s|10))(X'|j;), insert the computer, then measure in a basis including 
\vr) = t(s|00) — c|lr)) (g) \x) + ts|00) (8) \yj-), where the \yr) are vectors from the barycentre 
of a i^-simplex to its vertices satisfying {yi\yj) = —1, = K and t = l/\/l + Ks'^. If 
we see the outcome corresponding to \vs), we know that the output register did not take 
the value s, and also that the computer did not run. Thus we have some counterfactual 
information, but do not know precisely what value the output register took. However, given 
prior probabilities for the Ur, we can compute posterior probabilities for output r. 

The other type of probabilistic extension is obtained by relaxing the conditions (1) and 
(2) in the definition of CF computation. We may define a measurement outcome sequence 
m to be "approximately CF for r" if it has the following properties. For some (small) e: 
(1') With f/r, E l^hP < e where the sum is over all histories consistent with m except for 
the ail-off history (the latter history having probability significantly larger than e). 
(2') With Ui-r, m is seen with probability less than e. 

Hence if m is seen then with high probability the computational result is r and also with 
high probability, no computation has been done. 

Appendix A. 

We give here a proof of theorem 5.1. 

Proof. The upper bound here is 2, so suppose po + pi = 2 — e, for small e > 0. 
We decompose 5 into subspaces Sr^O, where r = a,b, /, n, and label each component 
before an insertion with the appropriate subscript. Just as we previously wrote histories as 
chains of /'s and n's (and measurement outcomes, which we suppress), we can now write 
histories as chains of a's, 6's, /'s and n's. We have 

where s ranges over all strings composed of a's and /'s, and Vg is obtained under Uq. 
Similarly for pi we have, 

with 5's and /'s, under Ui. 



Now consider a protocol with N insertions. After the first insertion ( |T7| ) and (18) imply 
Po < \va + Vf\^ = |uaP + b/P, (under Uq) 
Pi < \vb + ^/P = bfeP + b/P) (under Ui). 
Also, 

\Vaf + \vi,f + \Vff < 1, 

where the first term is derived under Uq, the second under Ui and the third under ei- 
ther. This holds because the Ur are all the identity on the subspaces in question. These 



16 



inequalities imply \vf\'^ > po + pi — 1 and |faP + l^feP < 2 — — Pi) which implies 
\va\ + \vb\ < - 2po - 2^*1. Since po + pi = 2 - e, we get 
\vf\^ > 1 - e. 

\va\ + \vb\ < V^e- 

Note that the first inequality implies 5/ is non-empty. 

Now look at the situation after n insertions. From ([T7| ) we have, under Uq: 



k=n-2 

Po<\ H Vas"~'=-i +«/"-ia + (19) 

fc=0 s 

Here f^as'^~^~^ denotes a history beginning with k /'s, followed by an o, then followed by 
some string of n — A; — 1 a's or /'s. A similar inequality holds for pi. 
We make the inductive hypothesis: 

l^/^al + \vfkb\ < ^k, for k < n - 1, 

where 5k can be made as small as desired by taking e small enough. We have established 
this for n = 1. 

Applying the triangle inequality to ([l^), we get 



Po < 



k=n-2 
k=Q s 



and using the orthogonality of Sa and Sf and the bound {vju-i^ + Vfn \ < 1 we get 

(k=n-2 \ ^ k=n-2 

Yl \'"f>'as"-k-A \ +2 X! \'"f>'as"-k-A + \Vf^-^a? + \VfA'^, 
k=0,s J k=0,s 

with a similar expression for pi. Applying our inductive hypothesis gives 

k=n-2 /k=n-2 \ ^ 

k=0 \ k=0 I 

where Uk counts the number of strings s for k. If we write this as 

b/"-ial^ + \vf^-^h\' + 2|?;/n|^ > Po +Pi - y 

where y can be made small by choosing e small enough (by the inductive hypothesis), we 
can subtract 



to obtain 

We also get 



> l-e-y. (20) 



l^r-ial + l^r-ifel < V2(e + y), 
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and putting 6n-i = \/2{e + y) establishes the inductive hypothesis. 

Now we reintroduce measurement outcomes in the notation. Asumme pi is determined 
from the measurement Mi. We can write 

Pi = IVfni + ^Vsil"^, (21) 

for i =0 or 1. As we are assuming po + Pi = 2 — e we have Pi > 1 — e for i =0 or 1, and 
hence, taking the square root in (^) 

Using the triangle inequality and adding the results for i =0 and 1 gives 

\vfnQ\ + \vfni\ + I ^ Vso\ + \^Vsi\ > 2 - 2e. 
aj bj 

As the subspaces for Mq and Mi are orthogonal, |t'/noP + ^ li implying |?;/nol + 

\vfni\ < \/2. So we have 

lIZ^'sol + l^^^sil >2- V2-2e. (22) 
Eq. (^ implies Y^aj l^sop + Sb,/ bsiP + |^/" P < 1 so from (pO|) we infer 

We now use the minimisation result that gave us theorem 3.2, namely that, for any vectors 
^i) J2i^i l^jP ^ I Xi\'^/K. Applying this to either of the terms on the left-hand side 
of (ID and (ID gives e + y>{2-V2- 2e)V2^, and by choosing e small enough we can 
ensure that must be as large as we please. 
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